Batch-Constrained Reinforcement Learning for Dynamic Distribution Network Reconfiguration
نویسندگان
چکیده
منابع مشابه
Batch Reinforcement Learning
Batch reinforcement learning is a subfield of dynamic programming-based reinforcement learning. Originally defined as the task of learning the best possible policy from a fixed set of a priori-known transition samples, the (batch) algorithms developed in this field can be easily adapted to the classical online case, where the agent interacts with the environment while learning. Due to the effic...
متن کاملAlgorithms for Batch Hierarchical Reinforcement Learning
Hierarchical Reinforcement Learning (HRL) exploits temporal abstraction to solve large Markov Decision Processes (MDP) and provide transferable subtask policies. In this paper, we introduce an off-policy HRL algorithm: Hierarchical Q-value Iteration (HQI). We show that it is possible to effectively learn recursive optimal policies for any valid hierarchical decomposition of the original MDP, gi...
متن کاملDynamic Network Formation with Reinforcement Learning
I examine a dynamic model of network formation in which individuals use reinforcement learning to choose their actions. Typically, economic models of network formation assume the entire network structure to be known to all individuals involved. The introduction of reinforcement learning allows us to relax this assumption. Q-learning is a reinforcement learning algorithm from the artificial inte...
متن کاملIntegrating Data Modeling and Dynamic Optimization using Constrained Reinforcement Learning
In this paper, we address the problem of tightly integrating data modeling and decision optimization, particularly when the optimization is dynamic and involves a sequence of decisions to be made over time. We propose a novel approach based on the framework of constrained Markov Decision Processes, and establish some basic properties concerning modeling/optimization methods within this formulat...
متن کاملReliability-Constrained Optimal Distribution System Reconfiguration
This work describes a method for reliability improvement of power distribution system via feeder reconfiguration. The work presented here is developed based on a linearized network model in the form of DC power flow and linear programming model in which current carrying capacities of distribution feeders and real power constraints have been considered. The optimal open/close status of the secti...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Smart Grid
سال: 2020
ISSN: 1949-3053,1949-3061
DOI: 10.1109/tsg.2020.3005270